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^^ , Abstract. Generalized traveling salesman problem (GTSP) is an ex- 

5-H ' tension of classical traveling salesman problem (TSP), which is a combi- 

*~(| natorial optimization problem and an NP-hard problem. In this paper, 

an efhcient discrete state transition algorithm (DSTA) for GTSP is pro- 
posed, where a new local search operator named K-circle, directed by 
^vjj' , neighborhood information in space, has been introduced to DSTA to 

shrink search space and strengthen search ability. A novel robust up- 
date mechanism, restore in probability and risk in probability (Double 
\^ , R-Probability) , is used in our work to escape from local minima. The 

^^ ' proposed algorithm is tested on a set of GTSP instances. Compared 

with other heuristics, experimental results have demonstrated the effec- 
tiveness and strong adaptability of DSTA and also show that DSTA has 
C^ ' better search ability than its competitors. 

Keywords: Generalized traveling salesman problem, Discrete state tran- 
sition algorithm, K-circle, Double R-probability 
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^ ' 1 Introduction 

o 

r~^ I Generally speaking, GTSP can be described as follow: given a completely undi- 

rected graph G = {V, E}, vifhere y is a set of n vertices and has been partitioned 
into m clusters V = {T4, V2, ■ • • , Kn}, E is a. set of m edges, and the goal of 

^^ . GTSP is to find a tour visiting each cluster exactly once while minimizing the 

sum of the route costs. In this paper, the symmetric GTSP is concerned, that 
is to say, Cij = Cj^i, here, the associated cost Ci,j for each pair of vertices (J, j) 
represents the distance from one vertex in Vi to another vertex in Vj . Since each 

^\ . cluster has at least one vertex and each vertex can only belong to one cluster, 

H I we have m < n. If m = n, GTSP is restored to TSP which together with GTSP 

- ■ ■ are both NP-hard problems [1]. To deal with TSP, we only need to optimize the 

sequence of the clusters, while in the process of solving GTSP we must determine 
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the sequence of the clusters and a vertex to be visited in each cluster simulta- 
neously, which indicates that GTSP is more complex than TSP. Nonetheless, 
GTSP is extensively used in many applications, such as task scheduling, airport 
selection and postal routing, etc[2,3]. 

According to the characteristics of GTSP, we can decompose the process of 
solving GTSP into two phases. One is to determine the visiting order of all the 
clusters, which is similar to TSP; the other is to find the optimal vertex in each 
cluster in a given order. Many reputed heuristic searching algorithms, like genetic 
algorithm (GA)[4] , particle swarm optimization (PS0)[5], simulated annealing 
(SA)[6], ant colony optimization (AGO) [7], have been varied into discrete ver- 
sions to solve GTSP. Though these algorithms have their own mechanisms to 
deal with continuous optimization problems, they have to adapt themselves to 
GTSP with some classic operators, such as swap and insert. These search opera- 
tors change the visiting order of clusters in particular ways. Lin-Kernighan(L-K) 
is a well-known method to solve TSP and GTSP [8], which focuses on changing 
the edges instead of the visiting order of clusters. The number of edges that L-K 
impacts in a single operation is unknown; as as result, the depth of L-K is usually 
limited within a constant [9]. The majority of these methods focus on finding an 
optimal sequence of clusters, while to solve GTSP, it still has to choose a vertex 
from each cluster to make the minimal cost simultaneously. This is a well-known 
shortest path problem in operations research which is also called cluster opti- 
mization (CO) in GTSP. The most common method to deal with this problem is 
dynamic programming that can give us a definitively best result which is named 
as layer network method in other literatures[10]. 

State transition algorithm is a new optimization algorithm, according to the 
control theory and state transition[ll]. The efficiency of STA in application to 
continuous optimization problems has been proved[12,13]. In [14,15], Discrete 
version of state transition algorithm has been introduced to solve a series of 
discrete optimization problems such as TSP, boolean integer programming. In 
this study, we wiU extend DSTA to solve the GTSP. 

In section 2, we give a brief description of DSTA and some transformation 
operators. Section 3 introduces relevancy and correlation index to describe K- 
Neighbor. In section 4, a DSTA is presented to solve GTSP with a new updating 
mechanism. Some experimental results are given in section 5, and the final part 
is the conclusion. 



2 Discrete State Transition Algorithm 

2.1 Description of DSTA 

State transition algorithm comes from control theory. It regards a solution to an 
optimization problem as a state and updating of the solution as state transition. 
The unified form of discrete state transition algorithm is given as follow: 

2/fc+i = fixk+i) 
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Where, Xk € K" denotes a current state, corresponding to current solution of an 
optimization problem; Uk G M" is a function of x^ and historical states; both Ak, 
Bk G R"^" are transition operators which are usually state transition matrixes; 
^ is an operation, which is admissible to operate on two states; / is the cost 
function or evaluation function. 

In general, the solution to discrete optimization problem is a sequence, which 
means a new state aj^-i^i should also be a sequence after transformation by A^ 
or Bk- For the TSP, only a state transition matrix is considered, avoiding the 
complexity of adding one sequence to another. So the form of DSTA for TSP is 
simplified as follow: 

Xk+i = GkXk 

Vk+i = .f{xk+i) 



(2) 



where, x^ = [xi^k-,X2,k-, ■■■ , a^m,fc]^, Xi^k G {1, 2, • • • , m}; Gk is the state tran- 
sition matrix which is created by transformation operators. State transition ma- 
trixes are variants of identity matrix with only position value 1 in each column 
and each row. Multiplying a state transition matrix by a current state will get 
a new state which is still a sequence and the process is like this: 
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2.2 State transformation operators 

DSTA for TSP works with 3 efficient operators, which is the foundation of study 
on GTSP. All of the 3 operators, swap, shift, symmetry, belong to Gk ■ 

1) Swap 

yXi^ X2j X^j ' • ' J Xi—ij Xij Xi-\-i^ • ' • jXm—ljXm) 
r t^^l, Xi-, X3, ■ • ■ , Xi—i^ X2, ^z+l, ■ ' ■ , ^m— 1; XjYij 

This is an operator to exchange several vertices in the tour and the number 
of the vertices to be changed is limited by a parameter nia- With this operator, 
the number of edges to be changed is twice as that of vertices to be exchanged. 

2) Shift 

\X\^ X2j X35 ' ' ' J Xi—i , Xij Xi-\-i , ■ • • , Xm—l , Xm) 
7 \X\^ X3, ■ • ■ , X^_i , Xj, X2 , Xi-\-\ , ■ • ■ , Xjyi — \ , XjYij 

This operator first removes a segment of sequence from a given tour and 
then inserts this segment into a random position of the remaining sequence. The 
length of the removed sequence is restricted to less than mi,. Three edges will 
be changed through this operator, of which two edges are adjacent and the last 
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edge is non-adjacent to them. 

3) Symmetry 

yXi^X2^''' ^ Xi—^^ Xi—2-, Xx — 1t Xi^ Xi-^i^ Xx-\-2^ -^x+S ' ' ' j-^ni) 
? \Xi , ^2 , ■ ■ ■ , X2_3 , X 1-1^2 1 '^a^+li "^i^ •^i~l^ •^x — 2 i ^a;-f3 ' ' ' 5 •^ra) 

Symmetry is a unique operator in STA, which is to choose a vertex in a given 
tour as center, and then mirror a small segment on the left side of the center to 
the opposite side and so does a small segment on the right side. The length of 
these small segments is restricted to rric. For symmetric GTSP, the symmetry 
operator can change two edges every time. 

4) Circle 

At the final stage of search process, a tour is usually locally optimal, which 
indicates that this tour has many similar segments to the global tour; therefore, 
the entire sequence can be regarded as the combination of these segments. To 
further optimize a tour, we have to change several conjoint vertices in some 
segments simultaneously. Considering that the number of changing vertices in 
swap or shift is no more than 2, a new operator called circle is proposed here to 
enhance the global search ability. 

Circle consists of two steps. First, we divide a given tour into two circles 
randomly, and then break one of the circles and insert it into another to create 
a new complete tour. Effects of this operator can be summarized as follow: 

1. One of the circles contains only one vertex(FiglTJa); 

2. Both of the circles contain more than one vertex, and change the connection 
at the interfaces of each circle(FiglT]b); 

3. Both of the circles contain more than one vertex, break one of the circles 
at its interface and insert it into another from a random position except the 
interface(FiglTJc, FiglTJd). 

4. Both of the circles contain more than one vertex, break one of the circles 
and insert it into another. Both of the breaking position and inserting position 
are randomly chosen except two interfaces. FiglUe and FiglHf show the results 
of this case. 

Obviously, circle is much more flexible than the other operators since it can 
gain 6 different kinds of cases. 

5) Cluster optimization(CO) 

This is a sole operator to find the best path of given visiting order of clusters. 
A tour [xi.fc, X2.fc, • • • , Xm.k] with costs W{xk) will be optimized into a new tour 
Xp, after running CO, here, W{x-^) < W{xk) and cluster{x^) ~ cluster{xk)- In 
general, few of visiting orders of clusters will be changed from Xk to Xj. , thus we 
only need to optimize a small segment around the changed clusters. 

3 K-Neighbor 

To improve the global search ability for large-scale problems in limited compu- 
tational time, it is necessary to avoid some potential bad search space. In this 
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Fig. 1. Effects of circle operator in different cases 



paper, the correlation index and relevancy are proposed, where correlation index 
is used to assess the correlation of every two clusters and the relevancy is applied 
to define K-Ncighbor which will guide search direction as heuristic information. 
Definition 1: Let define the distance between the geometric centers of cluster i 
and cluster j as d^j-, the sum of distance from the geometric centers of cluster i 
to the geometric centers of other clusters as di and denote r^ j as the correlation 
index of cluster i to cluster j : 



1 



dij 



n-1 ' 



(3) 



Y. ""^'1 






Definition 2: Given ri,j as the correlation index of cluster i to cluster j and 
Tj^i as the correlation index of cluster j to cluster i, then the relevancy pij of 
cluster i to cluster j can be formulated as: 



K,i 



E Tu, 



(4) 



j=i 



Calculating each pij , we can get a relevancy matrix. The ith row of the relevancy 
matrix shows the relevancy of cluster i to other clusters. A big pi^j indicates 
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cluster i is with high possibility connecting with cluster j. After sorting the 
relevancy matrix in descending order by row, the top k clusters in row i will 
be the K-Neighbor of cluster i. Using K-Neighbor as heuristic information, the 
global search ability can be improved significantly. 

4 DSTA for GTSP 

To adapt DSTA to GTSP and to make the algorithm more efficient, K-Neighbor 
is used as heuristic information to guide the search. Thus, fc-shift, fc-symmetry 
and /c-circle which are all guided by K-Neighbor are included in DSTA. The core 
procedure of the DSTA for GTSP can be outlined in pseudocode as follows: 



repeat 



[Best, Best*] <— swa,j>{SE, Best, Best*, nia) 
[Best, Best*] ^ shiit{SE, Best, Best*, mb) 
[Best, Best*] ^ k-ciTclc{SE, Best, Best* , K-Neighbor) 
[Best, Best*] <— k-sym.xn.etTy{SE, Best, Best* , K-Neighbor) 
[Best, Best*] ^ k-shiit{SE, Best, Best*, K-Neighbor) 
until the specified termination criterion is met 
where, SE is the search enforcement, representing the times of transformation 
by a certain operator; Best is the best solution from the candidate state set 
created by transformation operators; Best* is the best solution in history. There 
are five operators in DSTA to optimize the sequence of clusters. A short cluster 
optimization which only optimizes a small segment (no more than 5 vertices) 
around the changed vertices is contained in every transformation operator to find 
a minimum path of a given sequence in further. To escape from local minima, 
a new robust update mechanism, restore and risk in probability, called double 
R-Probability for short, is introduced. Risk in probability is to accept a bad 
solution with a probability pi. To ensure the convergence of DSTA, restore in 
probability p2 is designed to recover the best solution in history. 

5 Computational Results 

Instances used in this paper all come from GTSPLIB[16]. The number of clusters 
in these instances varies from 30 to 89. All of the algorithms, including DSTA, 
SA and AGO, are coded in matlab and run on an Intel Gore i5 3.10GHz under 
Window XP environment. In order to test the performance of the proposed 
operators and approach, DSTA is compared with SA and AGO, and 10 runs are 
carried out for the experiment. Some statistics arc used as follows 

Opt. : the best known solution. 

Best: the best solution obtained from the experiment, 

Aavg '■ the relative error of the average solution, 

mean(values) — Opt. ^ „„^ 

^avg = ^ „ / — X 100% 

Opt. 
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average time consumed. 



Results of comparison among DSTA, SA and AGO are listed in Table [TJ In 
DSTA, we set k = 8, nia = 2, mi, ~ 1. The initial temperature in SA is 5000 and 
the cooling rate is 0.97. In AGO, a — 1, /3 = 5, p = 0.95, where, a, (3 are used to 
control the relative weight of pheromone trail and heuristic value, and p is the 
pheromone trail decay coefficient. As can be seen from Tablc[TJ DSTA is superior 
to SA and AGO in both time consumption and solution quality. The Aavg of 
DSTA is very small, which indicates DSTA has good robustness and can obtain 
good solutions with high probability. In the 10 runs, DSTA obtains the optimal 
solution at almost each run for every instance, but SA and AGO seldom find 
the Opt. except for 30kroB150. Aavg of SA is smaller than that of AGO because 
SA accepts a bad solution with probability which can help it escape from local 



Table 1. Results of comparison for SA, AGO and DSTA. 
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6 Conclusions 



We added a new operator and heuristic information to DSTA to solve GTSP. 
K-Neighbor can guide the search direction, in a way to ignore all possible connec- 
tions among vertices. A flexible operator /c-circlc is guided by the K-Ncighbor, 
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which can change random segments freely in a tour. Double R-Possibility is help- 
ful to escape from local minima. It accepts a bad solution with a probability pi 
and restore the history best with another probability p2. All these strategies 
contribute to improving the performance of the DSTA. 
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